Quark and Gluon Tagging at the LHC 



Jason Gallicchio and Matthew D. Schwartz 
Department of Physics Harvard University Cambridge, MA 02138, U.S. A 

Being able to distinguish light-quark jets from gluon jets on an event-by-event basis could signifi- 
cantly enhance the reach for many new physics searches at the Large Hadron Collider. Through an 
exhaustive search of existing and novel jet substructure observables, we find that a multivariate ap- 
proach can filter out over 95% of the gluon jets while keeping more than half of the light-quark jets. 
Moreover, a combination of two simple variables, the charge track multiplicity and the pr-weighted 
linear radial moment (girth), can achieve similar results. While this pair appears very promising, 
our study is only Monte Carlo based, and other discriminants may work better with real data in 
a realistic experimental environment. To that end, we explore many other observables constructed 
using different jet sizes and parameters, and highlight those that deserve further theoretical and 
experimental scrutiny. Additional information, including distributions of around 10,000 variables, 
can be found on the website http://jets.physics.harvard.edu/qvg 



The Large Hadron Collider at CERN produces billions 
of jets a second. Understanding these jets may be the key 
to unraveling physics beyond the standard model. The 
jets at the LHC can be coarsely partitioned into quark 
or gluon jets. Almost always, the jets we would like most 
to study are quark jets and so removing the gluon jet 
background, if possible, could greatly enhance the search 
reach for many new physics scenarios. In this letter, we 
show that through a comprehensive study of light quark 
(uds) vs gluon discriminants, many of which take advan- 
tage of the improved resolution of the LHC detectors, 
gluon-tagging with good efficiency is achievable on an 
event- by- event basis. 

There are many situations in which quark/gluon tag- 
ging would be useful. For example, the jets produced in 
supersymmctric decay chains are usually entirely quark 
jets, while their backgrounds are mostly gluon jets. 
Quark tagging would be especially helpful in cases like R- 
parity violating SUSY where there are no additional han- 
dles like leptons, photons or missing energy. Even when 
there are leptonic decay modes, quark tagging could help 
measure the hadronic branching ratio of new physics, 
which will be needed to verify the model. Interesting 
standard model physics also tends to be quark-heavy. 
For example, vector-boson fusion requires the tagging of 
two forward jets which are always quarks, while back- 
ground jets in this forward region tend to be dominated 
by gluons. Alternatively, there are some scenarios where 
the new physics is in gluon jets (e.g. coloron models 
which produce 4 or more gluon jets), for which gluon 
tagging would also help. 

Quark and gluon jets are an extremely useful abstrac- 
tion, discussed in hundreds of papers and many experi- 
mental studies, despite their not having a precise theo- 
retical or experimental definition Any flavor tagging 
is only meaningful to the extent that there is a corre- 
spondence between hard partons and jets, which is the 
standard starting point for almost every collider search 
involving hadronic final states. The correspondence is af- 
fected by things like the jet algorithm, the event's topol- 



ogy, and the distance between energetic deposits. In this 
paper, quark or gluon jets refer to the parton which is 
produced in the hard process at leading order in pertur- 
bation theory and initiates the parton shower. At this 
level there is no ambiguity in what is meant by the jet 
flavor since there is no interference between different final 
states. In fact, the flavor is well-defined to to all orders 
in perturbation theory up to the same power corrections 
that affect any collinear and IR safe jet algorithm's par- 
ton correspondence Q. These power corrections involve 
the jet size R (equivalently the jet's mass-to-energy ratio 
m/E) and Aqcd/^- 

In this paper, we study pure quark or gluon samples 
and only consider observable properties of these jets. The 
expectation is that properties of jets coming from, say, 
SUSY decays will be more similar to the properties of 
the quark jets in our simulation, despite the fact that jet 
properties are not expected to be completely universal - 
jet substructure is affected by things like adjacent jets 
and differing color-connections. The result of this paper 
is a rough ranking of jet observables that can be used to 
distinguish light flavor and help find new physics signals. 
Once promising observables are found and their distribu- 
tions measured, it might be possible to construct an even 
more powerful tagger. 

Gluon tagging at the LHC is both more useful and 
achievable than at the Tevatron. The LHC's proton- 
proton initial state, higher energy, and higher luminosity 
increase the number of jets produced and make gluon jet 
backgrounds more common. CMS's particle flow [I[ and 
Atlas's individually calibrated TopoClusters @, along 
with each detector's improved calorimeter resolution, al- 
low unprecedented measurement of the energy and track 
distribution within jets. As we will see, the better the 
resolution on the jet constituents, the better the tagger 
will be. We also find that the higher pt jets of the LHC 
are more reliably tagged than lower px jets of previous 
colliders, as long as the tracking remains reliable. 

Much is known experimentally about quark and gluon 
jets (see [f| for a summary). One important LEP result 



was that 6-jets were more similar to gluon jets than to 
light-quark jets 0, Q : due to the longer decay chain of 
B-hadrons, the number of particles and angular spread 
is larger for a 6-jet than a light-quark jet. The similarity 
of 6-jets to gluon jets should be lessened in the LHC's 
higher px jets because the QCD shower produces more 
particles, whereas the particle multiplicity is relatively 
fixed in the S-hadron decay. There are already sophis- 
ticated and very detector-specific methods for 6-tagging. 
Current 6-taggers rely mostly on impact parameters or 
a secondary vertex, so they arc independent of the ob- 
scrvables we consider. Therefore, we restrict our study 
to discriminating light quarks (uds) from gluons. 

The accumulated knowledge from decades of experi- 
ments and perturbative QCD calculations have been in- 
corporated into Monte Carlo event generators, in par- 
ticular Pythia Q and Herwig These programs also 
include sophisticated hadronization and underlying event 
models which have also been tuned to data. Small dif- 
ferences still exist between these tools (and between the 
tools and data), but they provide an excellent starting 
point to characterize which obscrvables might be useful in 
gluon-tagging. The approach to gluon-tagging discussed 
here is to find observables which appear promising and 
then can be measured and calibrated on samples of mixed 
or pure quark or gluon jets at the LHC 0] ■ 

To understand the structure of a jet, it is important to 
distinguish obscrvables which average over all events from 
observables which are useful on an event- by- event basis. 
One example of an averaged observable is the classic inte- 
grated jet shape, ^(r), which has already been measured 
at the LHC [ll[. This jet shape is defined as the frac- 
tion of a jet's pt within a cone of radius r. Tradition- 
ally, jet shapes are presented as an average over all jets 
in a particular px or 77 range. For any r, the averaged 
jet shape becomes a single number, which is generally 
larger for quarks than for gluons because a greater frac- 
tion of a typical quark jet's pt is at the center of the 
jet. On traditional jet shape plots, error bars for each r 
are proportional to the standard deviation of the under- 
lying distribution, but that distribution is not a narrow 
Gaussian around the average. For example, the event-by- 
event distributions for $>(r = 0.1) are shown in Figure Q] 
for quarks and gluons. Jet shapes averaged over these 
distributions throw out useful information about the lo- 
cation and pt's of particles within the jet, along with 
their correlations. For event- by- event discrimination, it 
is crucial to have distributions, whereas most public data 
only describes averages. In this study we consider ^/(r) 
and many other variables to see which are best suited to 
quark/gluon tagging. 

To generate samples of quark and gluon jets we con- 
sidered samples of dijet events and 7+jet events. These 
were generated with MADGRAPH V4.4.26 [l2[ and show- 
ered through both pythia V8.140 [§] and HERWIG++ 
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^(r = 0.1) for 200 GeV Jets 




Integrated Jet Shape \I/(0.1) 



FIG. 1: Data on the integrated jet shape *&(?-) is usually 
published only when averaged over all events. Here we show 
the distribution of $(0.1), for quarks (blue, solid) and gluons 
(red, hollow). The event-by-event distributions of ^(r) and 
other observables are much more important for gluon tagging 
than average values. 

V2.4.2 fllj with the default tunes. Jets, reconstructed 
using fast jet V2.4.2 were required to have | ^7 1 < 1. 

We needed to isolate samples of quark and gluon jets 
with the similar jet px's. Unfortunately, we cannot get 
similar jet px's by having similar px's at the hard parton 
level, since the showering changes the px significantly, 
and differently for quarks and gluons. This is an unphys- 
ical difference, since the parton px is set artificially, and 
we have to avoid our tagger picking up on it. The solution 
we chose was to generate and shower a wide spectrum of 
dijet and 7+jet events, and require the resulting Anti-fc^ 
R=0.5 jets to lie within 10% of the central value for each 
of sixpT windows, centered around 50, 100, 200, 400, 800, 
and 1600 GeV. (The underlying hard partons spanned a 
range from half to twice the central value.) The pt spec- 
trum within each window matches the falling spectrum of 
the underlying dijet or 7+jet samples, which are nearly 
identical for quarks and gluons in narrow windows cho- 
sen. When the entire event is reclustered with a different 
jet size, as was done when examining how the observables 
change with R, the resulting jet px no longer necessarily 
lies within the narrow ±10% window. In fact, how the 
jet Pt changes with R forms a quark/gluon discriminant 
similar to integrated jet shape. 

With each sample of similar-p-p jets, there are two main 
types of observables useful in separating quarks from glu- 
ons: discrete ones, which try to distinguish individual 
particlcs/tracks/subjcts, and continuous ones that can 
treat the energy or px within the jet as a smooth func- 
tion of (8rj, S(f>) away from the jet axis in order to form 
combinations like geometric moments. 

The discrete category includes the number of distin- 
guishable tracks, small subjets, or reconstructed parti- 
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cles. Functions of this information, such as the average 
or the spread (standard deviation) of their px's were con- 
sidered. We find that this class of observables provide the 
best discrimination at high quark efficiency (mild cuts) 
and high jet px- 

The average multiplicity of any type of particle, along 
with its variance, are sensitive to the QCD charges of 
the underlying gluon (Ca = 3) or quark (Cf = 4/3). To 
leading order, 



(N g ) 
(N q ) 



Ca 
C f 



and 



Ca 
C f ' 



(1) 



The OPAL collaboration, among others, studied the 
charged particle multiplicity in light-quark and gluon jets 
of energy around 40 GeV to 45 GeV 14j and found dis- 
tributions that agree well with the Monte Carlo event 
generators and with analytic predictions. 

We find that the strongest discrete observable is the 
number of charged particles within the jet, where charged 
particles were required to have px > 500 McV. Lower 
cuttoffs actually lead to better discrimination power, so 
how well the LHC detectors will be able to resolve particle 
Pt will have important consequences for gluon tagging. 

Another discrete observable is the subjet multiplicity, 
which was also studied at LEP 15 , 1|| . Extremely small 



subjets approach the limit of particles and are sensitive 
to hadronization, but larger subjets probe the better 
modeled, perturbative physics and give the largest ra- 
tio between quark and gluon subjet multiplicities. For 
the higher-energy jets of the LHC, the optimal jet size 
is far smaller than the calorimeter resolution. We found 
that counting R su b=0.1 anti-Air jets was more powerful 
than other subjet algorithms and larger sizes, but not 
as powerful as counting the number of charged tracks. 
Small subjet multiplicity can serve as a reasonable sub- 
stitute if charged track multiplicity proves less reliable in 
some circumstances (perhaps at very high 77). Counting 
all hadrons works even better than charged tracks. 

Other observables in the discrete category that show 
reasonable discrimination power include the average dis- 
tance to jet axis (r), the px fraction of the N th hard- 
est track or subjet, and the subjet splitting scale (when 
the jet is reclustered with the kx algorithm). Finally, 
there are observables that take advantage of the electri- 
cal charge that quarks carry. Since the hardest hadrons 
produced at the end of the shower have charges corre- 
lated with the initiating quark, adding up the charges of 
all tracks weighted by their px gives some small discrim- 
ination. 

The second, more continuous, category of observables 
includes jet mass, jet broadening [17j ]. and the family of 
radial moments like girth 18 1, angularities [l||, and the 
optimal moment which are described below. These tend 
to perform better at lower jet px and for achieving high 
quark purity through harsh cuts. Other observables that 



try to capture the 2D shape or color connections of the 
jet, like pull [2(J, eccentricity, or planar flow [2l[, are less 
powerful in this application. 

We find the best single observable in the continuous 
category is the linear radial moment - a measure of the 
'width' or 'girth' of the jet - constructed by adding up 
the px deposits within the jet, weighted by distance from 
jet axis. It is defined as 



Linear Radial Moment (Girth): 
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where r.j = yj Ayf + A(f>f and where the true boost- 
invariant rapidity y should be used for the (massive) jet 
axis instead of the geometric pseudorapidity 77. Under the 
assumption of central jets with massless constituents at 
small angles, this linear moment is identical to jet broad- 
ening, defined as the sum of momenta transverse to the 
jet axis normalized by the sum of momenta. While jet 
broadening is natural at an e + e~ collider, the linear ra- 
dial moment is more natural and works a bit better at the 
LHC. Other geometric moments involving different pow- 
ers of r were not as powerful at discriminating quarks 
from gluons, including the jet mass, which is equal to 
the r 2 geometric moment in the same limit. 

By weighting the px by other functions of r, whole 
families of radial-kernel observables can be constructed: 
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Angularities [19|, [2l| are one such example, where the px 
and r arc usually replaced by energy and angle. Angu- 
larities are often normalized by the jet mass rather than 
the jet px, and we considered both normalizations. Both 
angularities and kernels which are powers of r suffer from 
sensitivity to the edge of the jet where their kernels arc 
greatest. This becomes problematic in crowded environ- 
ments with adjacent jets. 

Rather than try to guess a useful kernel, we attempted 
to optimize its shape numerically. By parameterizing the 
kernel as a spline with 5 to 10 points, a genetic algorithm 
was used to maximize gluon rejection for several differ- 
ent quark efficiencies. In all cases, the optimal kernels 
rose linearly from the axis of the jet out to r ~ 0.3, then 
turned over and decreased smoothly to zero at the edge 
of the jet, but the gluon rejection in call cases was rather 
insensitive to the region away from the center. These 
optimal kernels performed slightly better than the lin- 
ear radial moments, but not enough to justify additional 
focus here. 

By looking at combinations of observables, additional 
quark/gluon discrimination is achieved. The 2D his- 
tograms for the best discrete and continuous observables, 
charged particle count and the linear radial moment, are 
shown in Figure [5] While the two observables are corre- 
lated, it is still helpful to use both. In the third panel 
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Quark Gluon Likelihood: q/(q + g) 




FIG. 2: 2D Histograms of the two best observables, along with the likelihood formed by combining them bin-by-bin. 



of this figure, we show the 2D bin-by-bin likelihood dis- 
tribution. Given these variables, the discriminant that 
achieves optimal gluon rejection for a fixed quark effi- 
ciency is a simple cut on the appropriate likelihood con- 
tour. Cutting out the top-right corner, for example, elim- 
inates the most egrcgiously gluey jets. In practice, this 
can be pre-computed or measured in each jet px window. 
As part of jet energy scale calibrations, Atlas 22j has 
measured these two variables in dijet, 7-jet, and multi- 
jet samples and used them individually to determine the 
flavor composition to 10% precision. 

The same method can be applied for more than 2 ob- 
servables, but then the exact likelihood becomes impos- 
sible to map efficiently with limited training samples. A 
multivariate technique like Boosted Decision Trees can 
be employed to approximate this multidimensional like- 
lihood distribution, as explained in [iij . 

In summary, quite a number of single variables do com- 
parably well, while some (like pull or planar flow) do 
quite poorly at gluon tagging. We examined many com- 
binations of observables, and found significant improve- 
ment by looking at pairs, but only marginal gains be- 
yond that. The results for the gluon rejection as a func- 
tion of quark efficiency are shown for a number of the 
more interesting observables and combinations in Fig- 
ure [3] for 200 GeV jets. The relative performance of 
variables changed little with px even though the op- 
timal cuts do. Definitions and distributions of these 
variables, and thousands of others, can be found on 
|http : // j ets . physics . harvard . edu/ qvg. Good pairs 
of variables included one from the discrete category de- 
scribed above, such as particle count, and one more con- 
tinuous shape variable, like the linear radial moment 
(girth). 

As an example using these curves to estimate the im- 
provement in a search's reach, consider X — > WW — > 
qqqq whose background is mostly 4-jets from QCD, each 
of which is a gluon 80% of the time Q. By operating at 
60% quark efficiency, only 1 /10 th of gluons pass the tag- 
ger, which means (20%) 4 of the total QCD background 
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FIG. 3: Gluon rejection curves for several observables as a 
function of Quark Jet Acceptance. The results for 200 GeV 
Jets are shown, but other samples give similar results. The 
best pair of observables is charged track multiplicity and lin- 
ear radial moment (girth). The best group of five also includes 
jet mass for the hardest subjet of size R=0.2, the average kr 
of all R sub =0.1 subjets, and the 3rd such small subjet's pr 
fraction. 



passes. One measure of statistical significance in a count- 
ing experiment is S/ vB, perhaps within a particular in- 
variant mass window. Any starting significance can be 
improved by a factor of 3.2 using these cuts. The 60% 
operating point was chosen to maximize this significance 
improvement for this particular background composition, 
which highlights the need to characterize background re- 
jection for all signal efficiencies. 

Measurements of these variables are underway, but it 
would be very interesting to see distributions of and cor- 
relations between as many of the variables in Figure [3] 
as possible. To this end, it has recently been observed 
that 99% pure samples of quark jets can be obtained in 
7+2jet events, and 95% pure samples of gluon jets can be 
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obtained in 3-jet events [jj. These samples could provide 
a direct evaluation of the tagging technique at all jet pts, 
verify and help improve the Monte Carlo generators, and 
provide a test of perturbative QCD. 
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